373 research outputs found

    p3d – Python module for structural bioinformatics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>High-throughput bioinformatic analysis tools are needed to mine the large amount of structural data via knowledge based approaches. The development of such tools requires a robust interface to access the structural data in an easy way. For this the Python scripting language is the optimal choice since its philosophy is to write an understandable source code.</p> <p>Results</p> <p>p3d is an object oriented Python module that adds a simple yet powerful interface to the Python interpreter to process and analyse three dimensional protein structure files (PDB files). p3d's strength arises from the combination of a) very fast spatial access to the structural data due to the implementation of a binary space partitioning (BSP) tree, b) set theory and c) functions that allow to combine a and b and that use human readable language in the search queries rather than complex computer language. All these factors combined facilitate the rapid development of bioinformatic tools that can perform quick and complex analyses of protein structures.</p> <p>Conclusion</p> <p>p3d is the perfect tool to quickly develop tools for structural bioinformatics using the Python scripting language.</p

    Quantitative test of the barrier nucleosome model for statistical positioning of nucleosomes up- and downstream of transcription start sites

    Get PDF
    The positions of nucleosomes in eukaryotic genomes determine which parts of the DNA sequence are readily accessible for regulatory proteins and which are not. Genome-wide maps of nucleosome positions have revealed a salient pattern around transcription start sites, involving a nucleosome-free region (NFR) flanked by a pronounced periodic pattern in the average nucleosome density. While the periodic pattern clearly reflects well-positioned nucleosomes, the positioning mechanism is less clear. A recent experimental study by Mavrich et al. argued that the pattern observed in S. cerevisiae is qualitatively consistent with a `barrier nucleosome model', in which the oscillatory pattern is created by the statistical positioning mechanism of Kornberg and Stryer. On the other hand, there is clear evidence for intrinsic sequence preferences of nucleosomes, and it is unclear to what extent these sequence preferences affect the observed pattern. To test the barrier nucleosome model, we quantitatively analyze yeast nucleosome positioning data both up- and downstream from NFRs. Our analysis is based on the Tonks model of statistical physics which quantifies the interplay between the excluded-volume interaction of nucleosomes and their positional entropy. We find that although the typical patterns on the two sides of the NFR are different, they are both quantitatively described by the same physical model, with the same parameters, but different boundary conditions. The inferred boundary conditions suggest that the first nucleosome downstream from the NFR (the +1 nucleosome) is typically directly positioned while the first nucleosome upstream is statistically positioned via a nucleosome-repelling DNA region. These boundary conditions, which can be locally encoded into the genome sequence, significantly shape the statistical distribution of nucleosomes over a range of up to ~1000 bp to each side.Comment: includes supporting materia

    From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

    Get PDF
    ©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

    Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures

    Get PDF
    DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to understand many biological processes. Recent studies have shown that knowledge-based potential functions can be applied on protein-DNA co-crystallized structures to generate PWMs that are considerably consistent with experimental data. However, this success has not been extended to DNA-binding proteins lacking co-crystallized structures. This study aims at investigating the possibility of predicting the DNA sequences bound by DNA-binding proteins from the proteins' unbound structures (structures of the unbound state). Given an unbound query protein and a template complex, the proposed method first employs structure alignment to generate synthetic protein-DNA complexes for the query protein. Once a complex is available, an atomic-level knowledge-based potential function is employed to predict PWMs characterizing the sequences to which the query protein can bind. The evaluation of the proposed method is based on seven DNA-binding proteins, which have structures of both DNA-bound and unbound forms for prediction as well as annotated PWMs for validation. Since this work is the first attempt to predict target sequences of DNA-binding proteins from their unbound structures, three types of structural variations that presumably influence the prediction accuracy were examined and discussed. Based on the analyses conducted in this study, the conformational change of proteins upon binding DNA was shown to be the key factor. This study sheds light on the challenge of predicting the target DNA sequences of a protein lacking co-crystallized structures, which encourages more efforts on the structure alignment-based approaches in addition to docking- and homology modeling-based approaches for generating synthetic complexes

    Gauge symmetry and W-algebra in higher derivative systems

    Full text link
    The problem of gauge symmetry in higher derivative Lagrangian systems is discussed from a Hamiltonian point of view. The number of independent gauge parameters is shown to be in general {\it{less}} than the number of independent primary first class constraints, thereby distinguishing it from conventional first order systems. Different models have been considered as illustrative examples. In particular we show a direct connection between the gauge symmetry and the W-algebra for the rigid relativistic particle.Comment: 1+22 pages, 1 figure, LaTeX, v2; title changed, considerably expanded version with new results, to appear in JHE

    A mathematical and computational review of Hartree-Fock SCF methods in Quantum Chemistry

    Get PDF
    We present here a review of the fundamental topics of Hartree-Fock theory in Quantum Chemistry. From the molecular Hamiltonian, using and discussing the Born-Oppenheimer approximation, we arrive to the Hartree and Hartree-Fock equations for the electronic problem. Special emphasis is placed in the most relevant mathematical aspects of the theoretical derivation of the final equations, as well as in the results regarding the existence and uniqueness of their solutions. All Hartree-Fock versions with different spin restrictions are systematically extracted from the general case, thus providing a unifying framework. Then, the discretization of the one-electron orbitals space is reviewed and the Roothaan-Hall formalism introduced. This leads to a exposition of the basic underlying concepts related to the construction and selection of Gaussian basis sets, focusing in algorithmic efficiency issues. Finally, we close the review with a section in which the most relevant modern developments (specially those related to the design of linear-scaling methods) are commented and linked to the issues discussed. The whole work is intentionally introductory and rather self-contained, so that it may be useful for non experts that aim to use quantum chemical methods in interdisciplinary applications. Moreover, much material that is found scattered in the literature has been put together here to facilitate comprehension and to serve as a handy reference.Comment: 64 pages, 3 figures, tMPH2e.cls style file, doublesp, mathbbol and subeqn package

    Azimuthal anisotropy and correlations at large transverse momenta in p+pp+p and Au+Au collisions at sNN\sqrt{s_{_{NN}}}= 200 GeV

    Get PDF
    Results on high transverse momentum charged particle emission with respect to the reaction plane are presented for Au+Au collisions at sNN\sqrt{s_{_{NN}}}= 200 GeV. Two- and four-particle correlations results are presented as well as a comparison of azimuthal correlations in Au+Au collisions to those in p+pp+p at the same energy. Elliptic anisotropy, v2v_2, is found to reach its maximum at pt3p_t \sim 3 GeV/c, then decrease slowly and remain significant up to pt7p_t\approx 7 -- 10 GeV/c. Stronger suppression is found in the back-to-back high-ptp_t particle correlations for particles emitted out-of-plane compared to those emitted in-plane. The centrality dependence of v2v_2 at intermediate ptp_t is compared to simple models based on jet quenching.Comment: 4 figures. Published version as PRL 93, 252301 (2004

    Azimuthal anisotropy in Au+Au collisions at sqrtsNN = 200 GeV

    Get PDF
    The results from the STAR Collaboration on directed flow (v_1), elliptic flow (v_2), and the fourth harmonic (v_4) in the anisotropic azimuthal distribution of particles from Au+Au collisions at sqrtsNN = 200 GeV are summarized and compared with results from other experiments and theoretical models. Results for identified particles are presented and fit with a Blast Wave model. Different anisotropic flow analysis methods are compared and nonflow effects are extracted from the data. For v_2, scaling with the number of constituent quarks and parton coalescence is discussed. For v_4, scaling with v_2^2 and quark coalescence is discussed.Comment: 26 pages. As accepted by Phys. Rev. C. Text rearranged, figures modified, but data the same. However, in Fig. 35 the hydro calculations are corrected in this version. The data tables are available at http://www.star.bnl.gov/central/publications/ by searching for "flow" and then this pape
    corecore